Identifying genetic interactions in genome-wide data using Bayesian networks.
نویسندگان
چکیده
It is believed that interactions among genes (epistasis) may play an important role in susceptibility to common diseases (Moore and Williams [2002]. Ann Med 34:88-95; Ritchie et al. [2001]. Am J Hum Genet 69:138-147). To study the underlying genetic variants of diseases, genome-wide association studies (GWAS) that simultaneously assay several hundreds of thousands of SNPs are being increasingly used. Often, the data from these studies are analyzed with single-locus methods (Lambert et al. [2009]. Nat Genet 41:1094-1099; Reiman et al. [2007]. Neuron 54:713-720). However, epistatic interactions may not be easily detected with single-locus methods (Marchini et al. [2005]. Nat Genet 37:413-417). As a result, both parametric and nonparametric multi-locus methods have been developed to detect such interactions (Heidema et al. [2006]. BMC Genet 7:23). However, efficiently analyzing epistasis using high-dimensional genome-wide data remains a crucial challenge. We develop a method based on Bayesian networks and the minimum description length principle for detecting epistatic interactions. We compare its ability to detect gene-gene interactions and its efficiency to that of the combinatorial method multifactor dimensionality reduction (MDR) using 28,000 simulated data sets generated from 70 different genetic models We further apply the method to over 300,000 SNPs obtained from a GWAS involving late onset Alzheimer's disease (LOAD). Our method outperforms MDR and we substantiate previous results indicating that the GAB2 gene is associated with LOAD. To our knowledge, this is the first successful model-based epistatic analysis using a high-dimensional genome-wide data set.
منابع مشابه
The Impact of Different Genetic Architectures on Accuracy of Genomic Selection Using Three Bayesian Methods
Genome-wide evaluation uses the associations of a large number of single nucleotide polymorphism (SNP) markers across the whole genome and then combines the statistical methods with genomic data to predict the genetic values. Genomic predictions relieson linkage disequilibrium (LD) between genetic markers and quantitative trait loci (QTL) in a population. Methods that use all markers simultaneo...
متن کاملProject Portfolio Risk Response Selection Using Bayesian Belief Networks
Risk identification, impact assessment, and response planning constitute three building blocks of project risk management. Correspondingly, three types of interactions could be envisioned between risks, between impacts of several risks on a portfolio component, and between several responses. While the interdependency of risks is a well-recognized issue, the other two types of interactions remai...
متن کاملLoad-Frequency Control: a GA based Bayesian Networks Multi-agent System
Bayesian Networks (BN) provides a robust probabilistic method of reasoning under uncertainty. They have been successfully applied in a variety of real-world tasks but they have received little attention in the area of load-frequency control (LFC). In practice, LFC systems use proportional-integral controllers. However since these controllers are designed using a linear model, the nonlinearities...
متن کاملBayesian Exploration of Multilocus Interactions on the Genome-Wide Scale
Problem statement: Recent technological and scientific advances propelled the field of Genome-Wide Association Study (GWAS), which promises to be instrumental in linking many common complex diseases to their genetic origin. While so far such large-scale surveys have been moderately successful in identifying disease related genetic variants, much of disease heritability is still not accounted fo...
متن کاملDiscovering Alzheimer Genetic Biomarkers Using Bayesian Networks
Single nucleotide polymorphisms (SNPs) contribute most of the genetic variation to the human genome. SNPs associate with many complex and common diseases like Alzheimer's disease (AD). Discovering SNP biomarkers at different loci can improve early diagnosis and treatment of these diseases. Bayesian network provides a comprehensible and modular framework for representing interactions between gen...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Genetic epidemiology
دوره 34 6 شماره
صفحات -
تاریخ انتشار 2010